On efficient estimators of the proportion of true null hypotheses in a multiple testing setup

نویسندگان

  • Van Hanh Nguyen
  • Catherine Matias
چکیده

We consider the problem of estimating the proportion θ of true null hypotheses in a multiple testing context. The setup is classically modeled through a semiparametric mixture with two components: a uniform distribution on interval [0, 1] with prior probability θ and a nonparametric density f . We discuss asymptotic efficiency results and establish that two different cases occur whether f vanishes on a set with non null Lebesgue measure or not. In the first case, we exhibit estimators converging at parametric rate, compute the optimal asymptotic variance and conjecture that no estimator is asymptotically efficient (i.e. attains the optimal asymptotic variance). In the second case, we prove that the quadratic risk of any estimator does not converge at parametric rate. We illustrate those results on simulated data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generalized estimators for multiple testing : proportion of true nulls and false discovery rate

Two new estimators are proposed: one for the proportion of true null hypotheses and the other for the false discovery rate (FDR) of one-step multiple testing procedures (MTPs). They outperform existing such estimators when applied to discrete p-values whose null distributions dominate the uniform distribution and reduce to leading such estimators when applied to continuous p-values. For the new...

متن کامل

Post hoc power estimation in large-scale multiple testing problems

BACKGROUND The statistical power or multiple Type II error rate in large-scale multiple testing problems as, for example, in gene expression microarray experiments, depends on typically unknown parameters and is therefore difficult to assess a priori. However, it has been suggested to estimate the multiple Type II error rate post hoc, based on the observed data. METHODS We consider a class of...

متن کامل

Estimating the proportion of true null hypotheses, with application to DNA microarray data

We consider the problem of estimating the proportion of true null hypotheses, π0, in a multiple-hypothesis set-up. The tests are based on observed p-values. We first review published estimators based on the estimator that was suggested by Schweder and Spjøtvoll. Then we derive new estimators based on nonparametric maximum likelihood estimation of thep-value density, restricting to decreasing an...

متن کامل

An adaptive significance threshold criterion for massive multiple hypotheses testing

This research deals with massive multiple hypothesis testing. First regarding multiple tests as an estimation problem under a proper population model, an error measurement called Erroneous Rejection Ratio (ERR) is introduced and related to the False Discovery Rate (FDR). ERR is an error measurement similar in spirit to FDR, and it greatly simplifies the analytical study of error properties of m...

متن کامل

Estimating the proportion of true null hypotheses when the statistics are discrete

MOTIVATION In high-dimensional testing problems π0, the proportion of null hypotheses that are true is an important parameter. For discrete test statistics, the P values come from a discrete distribution with finite support and the null distribution may depend on an ancillary statistic such as a table margin that varies among the test statistics. Methods for estimating π0 developed for continuo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013